Naïve Bayes Classifier for Arabic Word Sense Disambiguation

نویسندگان

  • Samir Elmougy
  • Taher Hamza
  • Hatem M. Noaman
چکیده

Word Sense Disambiguation (WSD) is the process of selecting a sense of an ambiguous word in a given context from a set of predefined senses. Sense Inventory usually comes from a dictionary or thesaurus. In Arabic, the main cause of word ambiguity is the lack of diacritics of the most digital documents so the same word can occurs with different senses. In this paper, we use the rooting algorithm with Naïve Bayes Classifier to solve the ambiguity of nondiacritics words in Arabic language. Our Experimental study proves that using of rooting algorithm with Naïve Bayes (NB) Classifier enhances the accuracy by 16% and also decreases the dimensionality of the training documents.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Naïve Bayes Approach for Word Sense Disambiguation

The word sense disambiguation (WSD) is the task ofautomatically selecting the correct sense given a context and it helps in solving many ambiguity problems inherently existing in all natural languages.Statistical Natural Language Processing (NLP),which is based on probabilistic, stochastic and statistical methods, has been used to solve many NLP problems.The Naive Bayes algorithm which is one o...

متن کامل

Using Fuzzifiers to Solve Word Sense Ambiguation in Arabic Language

Text mining techniques confront many challenges when dealing with the Arabic language including lexical disambiguation because Arabic is a highly inflectional and derivational language, most of the Arabic texts are devoid of diacritics especially Modern Standard Arabic (MSA), thus, it is a must to depend on the ambiguous word context under study. Two fuzzy logic classifiers have been built and ...

متن کامل

Lexical Disambiguation of Arabic Language: An Experimental Study

In this paper we test some supervised algorithms that most of the existing related works of word sense disambiguation have cited. Due to the lack of linguistic data for the Arabic language, we work on non-annotated corpus and with the help of four annotators; we were able to annotate the different samples containing the ambiguous words. Since that, we test the Naïve Bayes algorithm, the decisio...

متن کامل

Modeling Consensus: Classifier Combination for Word Sense Disambiguation

This paper demonstrates the substantial empirical success of classifier combination for the word sense disambiguation task. It investigates more than 10 classifier combination methods, including second order classifier stacking, over 6 major structurally different base classifiers (enhanced Naïve Bayes, cosine, Bayes Ratio, decision lists, transformationbased learning and maximum variance boost...

متن کامل

Exemplar-Based Word Sense Disambiguation" Some Recent Improvements

In this paper, we report recent improvements to the exemplar-based learning approach for word sense disambiguation that have achieved higher disambiguation accuracy. By using a larger value of k, the number of nearest neighbors to use for determining the class of a test example, and through 10-fold cross validation to automatically determine the best k, we have obtained improved disambiguation ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008